[TOPI] Alleviate hanging issue caused by concat op #3268

Laurawly · 2019-05-31T07:23:26Z

@tqchen This is a temporary solution to alleviate the problem, will work on the IR builder to write the schedule for concate op.

tqchen · 2019-05-31T16:54:00Z

cc @jroesch @wweic the CI problem could relate to limitations in VM

masahi · 2019-05-31T22:29:56Z

operator fusion tests are heavily dependent on concat being fusible as injective. If we want to make concat opaque, corresponding updates to fusion tests are needed.

If this is indeed a temporary solution, I suggest just disable fusion tests that involve concat.

jroesch · 2019-06-01T07:52:17Z

I have a patch for the VM issue which is almost complete, will post tomorrow sometime.

kevinthesun · 2019-06-01T18:00:08Z

Will this change affect the performance of concat for cpu？

Laurawly · 2019-06-02T18:58:17Z

Will this change affect the performance of concat for cpu？

Previously ssd_mobilenet1.0_512 cost 1.4437s on arm CPU now it costs 1.4545s.

kevinthesun · 2019-06-03T18:15:29Z

I tested on x86 cpu with a set of gluoncv models. This change doesn't affect the performance.

apivovarov · 2019-06-06T01:10:32Z

This fix solves 10-15 min hanging issue for the first inference
But it causes 20% performance degradation for the second and consequent inferences on GPU.
I tested it for GluonCV ssd_512_mobilenet1.0_voc model on Mali GPU OpenCL (RK3399)
The second and consequent inferences take:
before the fix: 420ms
after the fix: 520ms

Laurawly · 2019-06-06T02:34:46Z

This fix solves 10-15 min hanging issue for the first inference
But it causes 20% performance degradation for the second and consequent inferences on GPU.
I tested it for GluonCV ssd_512_mobilenet1.0_voc model on Mali GPU OpenCL (RK3399)
The second and consequent inferences take:
before the fix: 420ms
after the fix: 520ms

The performance reported by tvm evaluator with hanging issue was 470 ms on Mali GPU RK3399. But the hanging time is more than 10 minutes which composes most of the end-to-end time. And by this fix, the hanging issue is alleviated, and performance evaluated by tvm time evaluator is 517 ms. You are more than welcome to fix the hanging issue without performance lost.

Laurawly · 2019-06-06T04:10:58Z

operator fusion tests are heavily dependent on concat being fusible as injective. If we want to make concat opaque, corresponding updates to fusion tests are needed.

If this is indeed a temporary solution, I suggest just disable fusion tests that involve concat.

@tqchen Shall we disable fusion tests that involve concat?

hlu1 · 2019-06-07T01:06:29Z

@apivovarov, making Concat opaque allows you to write fast schedules specially for concat, for instance, https://github.com/dmlc/tvm/blob/master/topi/python/topi/x86/injective.py#L53 for x86 concat schedules. Note that the concat schedule is only guaranteed to be used only when concat is an opaque or output_elem_fusable op. The default injective schedule would be used when concat is an injective op and fused together with other injective ops.

tqchen · 2019-06-07T18:31:12Z

Some related followup thoughts https://discuss.tvm.ai/t/explore-optimizations-for-concat/2435/7

tqchen · 2019-09-13T20:25:45Z

Close this for now due to inactive status, @sxjscience has volunteered to continue working on the thread https://discuss.tvm.ai/t/explore-optimizations-for-concat/2435/7

sxjscience · 2019-09-13T21:08:42Z

@tqchen I'm still working on that. Need to wait for some more time...

jroesch mentioned this pull request Jun 4, 2019

[Relay][VM] Fix code generation for packed functions + tuples #3287

Merged

Laurawly force-pushed the dev branch from 5302d8d to b029db3 Compare June 6, 2019 02:48

icemelon added the status: need update need update based on feedbacks label Jun 18, 2019

Laurawly added 2 commits September 5, 2019 11:34

alleviate hanging issue caused by concat op

cb48ec0

typo

e041a90

Laurawly force-pushed the dev branch from b029db3 to e041a90 Compare September 5, 2019 18:35

tqchen closed this Sep 13, 2019

tqchen added status: inactive and removed status: need update need update based on feedbacks labels Sep 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TOPI] Alleviate hanging issue caused by concat op #3268

[TOPI] Alleviate hanging issue caused by concat op #3268

Laurawly commented May 31, 2019

tqchen commented May 31, 2019

masahi commented May 31, 2019 •

edited

Loading

jroesch commented Jun 1, 2019

kevinthesun commented Jun 1, 2019

Laurawly commented Jun 2, 2019

kevinthesun commented Jun 3, 2019

apivovarov commented Jun 6, 2019 •

edited

Loading

Laurawly commented Jun 6, 2019 •

edited

Loading

Laurawly commented Jun 6, 2019

hlu1 commented Jun 7, 2019

tqchen commented Jun 7, 2019

tqchen commented Sep 13, 2019

sxjscience commented Sep 13, 2019

[TOPI] Alleviate hanging issue caused by concat op #3268

[TOPI] Alleviate hanging issue caused by concat op #3268

Conversation

Laurawly commented May 31, 2019

tqchen commented May 31, 2019

masahi commented May 31, 2019 • edited Loading

jroesch commented Jun 1, 2019

kevinthesun commented Jun 1, 2019

Laurawly commented Jun 2, 2019

kevinthesun commented Jun 3, 2019

apivovarov commented Jun 6, 2019 • edited Loading

Laurawly commented Jun 6, 2019 • edited Loading

Laurawly commented Jun 6, 2019

hlu1 commented Jun 7, 2019

tqchen commented Jun 7, 2019

tqchen commented Sep 13, 2019

sxjscience commented Sep 13, 2019

masahi commented May 31, 2019 •

edited

Loading

apivovarov commented Jun 6, 2019 •

edited

Loading

Laurawly commented Jun 6, 2019 •

edited

Loading